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We study dynamical error suppression from the perspective of reducing sequencing complexity, 
in order to facilitate efficient semi-autonomous quantum-coherent systems. With this aim, we focus 
on digital sequences where all interpulse time periods are integer multiples of a minimum clock 
period and compatibility with simple digital classical control circuitry is intrinsic, using so-called 
Walsh functions as a general mathematical framework. The Walsh functions are an orthonormal 
set of basis functions which may be associated directly with the control propagator for a digital 
modulation scheme, and dynamical decoupling (DD) sequences can be derived from the locations 
of digital transitions therein. We characterize the suite of the resulting Walsh dynamical decoupling 
(WDD) sequences, and identify the number of periodic square- wave (Rademacher) functions required 
to generate a Walsh function as the key determinant of the error-suppressing features of the relevant 
WDD sequence. WDD forms a unifying theoretical framework as it includes a large variety of well- 
known and novel DD sequences, providing significant flexibility and performance benefits relative to 
basic quasi-periodic design. We also show how Walsh modulation may be employed for the protection 
of certain nontrivial logic gates, providing an implementation of a dynamically corrected gate. Based 
on these insights we identify Walsh modulation as a digital-efficient approach for physical-layer error 
suppression. 

PACS numbers: 03.67.Pp, 03.65.Yz, 37.10.Ty 



I. INTRODUCTION 

Dynamical quantum error correction has been pro- 
posed as a strategy by which arbitrarily accurate evo- 
lutions may in principle be implemented in a large class 
of open quantum systems. This approach involves ap- 
plication of open-loop control protocols at the physical 
level [TMl6|; through time-dependent modulation of the 
system's dynamics, the effects of an environment which 
fluctuates sufficiently slowly are coherently averaged out. 
Dynamical decoupling (DD) is an experimentally vali- 
dated [T7H55] subclass of these protocols specifically tai- 
lored to the task of suppressing decoherence during the 
implementation of the identity operator - resulting in 
improved quantum storage. 

DD takes physical inspiration from the spin echo in 
nuclear magnetic resonance (NMR) 34-36J, and relies 
on deterministic (periodic [T] or aperiodic [HI EE]), or 
even random [37J 13H] modulation of the idle system 
via pulsed control. A proliferation of analytical for- 
malisms and new DD schemes has appeared in the liter- 
ature, in particular for the paradigmatic case of a single 
qubit exposed to pure (classical and/or quantum) de- 
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phasing pj EH EQl EH EM!]. while an understanding 
of the performance of these sequences may be unified by 
the application of a noise filtering framework [7J 1161 1451 - 
ST], each sequence brings particular requirements for the 
necessary pulse timings, often incorporating nonintuitive 
analytical expressions or numerical search to define pulse 
locations in a sequence. As a result, the generation of 
DD pulse sequences at the lab bench is usually accom- 
plished using specially programmed microcontrollers or 
a PC under user control. 

In this work, we address the problem of control com- 
plexity in dynamical error correction, introducing a set of 
digital DD protocols optimized for hardware compatibil- 
ity and minimization of sequencing complexity. These 
protocols are based on the Walsh functions [15J 135] , 
which take binary values and are composed of products 
of square waves, forming an orthonormal basis similar 
to the sines and cosines. The Walsh functions bene- 
fit from compact notation and a uniform mathematical 
basis for sequence construction. We describe the error- 
suppressing properties of Walsh modulation and Walsh 
dynamical decoupling (WDD), and introduce a quanti- 
tative metric for sequencing complexity, r, the number of 
Rademacher square-wave functions which must be mul- 
tiplied (or added mod-2) in hardware to generate the 
control propagator for that sequence. The Rademacher 
functions in turn can be trivially generated via ele- 



2 



mentary digital circuits synchronous with a distributed 
clock signal. Thus, by construction Walsh modulation 
is highly compatible with simple digital sequencing cir- 
cuitry and digital clocking as may be needed in large 
or semi-autonomous systems. This approach therefore 
provides efficient error suppression, important for real- 
world implementations beyond simple demonstration ex- 
periments. 

The remainder of this manuscript is organized as fol- 
lows. After introducing the relevant system and control 
setting in Section [TTJ Section |III| is devoted to describ- 
ing the mathematical formulation and key features of 
the Walsh functions, and their natural use in defining 
WDD schemes. In this section, our main results are es- 
tablished, including an exact relationship between the 
sequencing complexity r and the order of decoherence 
suppression in the perturbative limit. Here, we also de- 
scribe the entire taxonomy of WDD sequences, discussing 
relationships between WDD and well-known sequences, 
as well as characterizing new DD sequences with digital 
timing. In Sec. |IV[ we discuss extensions of the WDD 
formalism beyond the simplest setting of a single qubit 
exposed to classical dephasing noise. In particular, we 
show how two-axis generalizations of WDD naturally re- 
cover and expand existing concatenated DD schemes, and 
discuss how Walsh modulation allows enhancement of the 
fidelity of nontrivial gate operations, examining a recent 
trapped- ion experiment as an example [50] . We follow 
this with an analysis of the explicit benefits of WDD 
over other optimized DD approaches in Sec.|Vj and close 
with a brief summary and outlook. A proof of WDD er- 
ror suppression properties from Rademacher functions is 
included in Appendix A for completeness. 



II. PHYSICAL SETTING 

DD is applicable to a variety of non-Markovian error 
models (including arbitrary many-qubit systems interact- 
ing with quantum environments, [21 151)) and can address 
non-ideal pulses or continuous control scenarios (as in 
Eulerian DD [5]). Our starting point here, however, is 
the simplest yet practically important case of a single 
qubit subject to classical phase noise and controlled with 
ideal it pulses. In this setting, the qubit is affected by an 
undesired noise Hamiltonian, 



(1) 



where j3(t) is a stochastic process and Ci denote the Pauli 
matrices for i = x, y, z. We assume that the system 
can be controlled by application of idealized sequences 
of qubit rotations, each corresponding to a tt rotation 
around the x axis, i.e., to the unitary operator o~ x . Each 
sequence is characterized by the pulse timings {tj}j =1 , 
where io = and t s+ i = r denote the initial (prepa- 
ration) and final (readout) times respectively. We use 
6j = tj/r to denote the "normalized pulse locations" and 



the "pulse pattern" p — {£j}J =1 , to distinguish among 
various sequences with the same running time |52j . 

During the evolution, the n pulses implement the con- 
trol propagator 



U c (t) 



r [y(t) + l]/2 



where y(t) takes values ±1 and switches instantaneously 
between these values at times corresponding to applica- 
tion of the 7r pulses. For brevity, we shall refer to y(t) 
as the "sequence propagator" in what follows. The "fil- 
ter function" F p (ujt) associated with the pulse pattern 
p = {Sj} and sequence duration r is defined in terms of 
ji(wr), the Fourier transform of the sequence propagator 
SOl [461: 



F p (cjt) = u) 2 \y(uT)\< 



^{-\y{e lS ^ T 

3=0 



(2) 
(3) 



The case of free evolution (also referred to as free in- 
duction decay, FID, in NMR terminology) is formally 
included by letting s = 0. 

The presence of the noise Hamiltonian ([!]) implies a co- 
herence loss at the readout due to phase randomization, 
resulting from the ensemble average with respect to noise 
realizations. The action of DD is to break the system's 
evolution into a sequence of interactions with the envi- 
ronment with alternating signs, resulting in significantly 
reduced phase accumulation at the end of the sequence 
[TJ[2] . This may be effectively represented as the expected 
value of the convolution of the stochastic noise term with 
the sequence propagator [TU], [151 133 HOI SS] ■ In particu- 
lar, the filter function provides a compact exact expres- 
sion for the coherence decay under Gaussian noise; if the 
system is prepared in a superp osition of eigenstates of 
a z , its coherence W = |(ct + )(t)| decays as e~ Xp ^ T \ where 



X P (r) 



F p {wT)duj, 



(4) 



and Sp{uj) is the power spectrum of the noise (3(t). 
While nominally the integration range in Eq. Q is infi- 
nite, in practice Sp(u>), and thus the "spectral measure" 
A(cj) = S/3(uj)/2ttuj 2 [44], is significant only for frequen- 
cies smaller than an ultraviolet cut-off frequency u c . The 
filter function can also be employed for a qubit coupled to 
a purely-dephasing quantum bosonic environment with a 
similar expression for decoherence under DD sequences 
[TJ [TU] ■ The effect of continuous control can also be ap- 
proximated within the filter function formalism as long 
as terms that are of second order and higher in H no - lse are 
ignored [Hill] (see also Sec. |IV| . 

The filter function enters the integrand in Eq. Q as a 
multiplicative factor of A(w), therefore, as long as F p (lut) 
is small for uj < uj c , the coherence loss will also remain 
small, as desired. Determining the actual value of Xp( T ) 
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requires a detailed knowledge of Sp(bj), yet we may com- 
pare the coherence associated with various DD sequences 
(including free evolution) by directly comparing their cor- 
responding filter functions over the interval [0,w c r]. For 
all sequences the filter function vanishes at zero, however 
we may differentiate the low-frequency behaviors (rates 
of growth) among various timing patterns. Let the low- 
frequency behavior of the filter function be given by 

F p {ur) oc {ujt) 2 ^ +1 \ (5) 

corresponding to 

ji(wr) oc (wr) a . 

In the language of filter design [IB], Eq. ^fy, defines a 
"highpass filter" with a "rolloff" of 6(a + l)dB/octave. 
As long as the cutoff frequency u> c is sufficiently small, the 
low-frequency behavior of the filter function translates to 

1 — W cc (w c t) 2 " +1 . 

For example, the filter function for free evolution 
i ? { ,i}( WT ) = 2sin 2 (wr/2) corresponds to a = 0, while 
high-order DD sequences have larger positive a's. The 
frequency range over which the filter function suppresses 
noise is given by the "bandwidth" f2 p , roughly defined as 
the largest frequency below which F p (ojt) < 1, a value 
approximately commensurate with the value u>fi intro- 
duced in previous analyses of the filter function [16]. 
Large values of the bandwidth Q p improve the high- 
frequency robustness of a particular DD sequence and 
allow it to be used with comparatively lower pulse rates. 

III. DYNAMICAL DECOUPLING BY DIGITAL 
MODULATION 

Many previously developed schemes for DD rely on ac- 
cess to sequences defined in continuous time, which is a 
good approximation for most benchtop experiments con- 
ducted today. In the long term, however, where large 
systems with error rates deep below the fault-tolerance 
threshold are required, these approximations will cease 
to provide an accurate estimate of residual error rates. 
Previous work has shown that the benefits of optimized 
sequences such as UDD become diminished in such cases 
where we expect the use of digital control and discretized 
time [16] . We are thus motivated to find digital modula- 
tion schemes that are both intrinsically compatible with 
discrete time and with the digital control hardware that 
will inevitably be employed in sequencing. 

Wc define two relevant timescales for such a modula- 
tion scheme; the total running time r, and a minimum 
interval (minimum switching time [43]) given by 

' mm ' / j 

where 2 m is the largest possible number of free-evolution 
periods in an applied sequence. In practice, r m j n is 



bounded from below by technological constraints such 
as modulation rates or hardware clock speeds. In the 
case of digital modulation, all interpulse periods must be 
defined as integer multiples of T m i n . This in turn places 
constraints on the allowed values of 5j in a sequence. Ir- 
rational values of fractional pulse locations have intrinsic 
conflict with digital modulation, thus mandating alter- 
nate approaches. 

In order to overcome this challenge, we have identified 
the Walsh functions as a mathematical basis that is com- 
patible with digital sequencing hardware. In this section 
we discuss the Walsh functions and the WDD derived 
from them. 



A. Walsh functions and WDD 

The Walsh functions are a family of binary valued (±1) 
piecewise-constant functions on the [0,1] interval |53) . 
They found a place in engineering in the 1960s, when 
they started being applied to problems ranging from com- 
munications and signal analysis to image processing, and 
noise filtering [481 E2] • The Walsh functions come in a 
variety of labeling conventions, including the Hadamard, 
"sequency", and Paley (or dyadic). In particular, the 
sequency ordering counts the number of "switchings" of 
the Walsh functions. In this paper, however, we focus 
on the Paley ordering, given in terms of the so-called 
Rademacher functions [54] , which are defined as 

Rj(x) = sgn [sin (2 J 7ra)] , j > 0, 

and correspond to periodic digital switchings between ±1 
over [0,1] with the "rate" 2 J . The Walsh function of 
Paley order n, W„(x), is then defined as [55] 

m 

W n (ar) = n^) 6 "' x£ [M, (6) 

3=1 

where we denote by (b m b m -i—bi)2 the binary represen- 
tation of n, that is, n = b m 2 m - 1 + b m - 1 2 m - 2 + . . . + h2°. 
The actual number, r, of Rademacher functions used in 
constructing a Walsh function, is the Hamming weight 
(number of non-zero binary digits) of n. For illustration, 
the Walsh functions {W„(a;)}^ 2 =1 in the Paley ordering 
are shown in Fig. [ija). The Walsh functions form an 
orthonormal basis over [0,1], that is, 

/ W n (x)W m {x)dx = S mn , (7) 
Jo 

where 5 mn is the Kronecker delta. Any integrable func- 
tion f(x) defined over [0, 1] has a convergent Walsh- 
Fourier expansion similar to the usual Fourier series: 

oo -i 

f(x) = J2a n W n (x), a n = / f(x)W n (x)dx. (8) 

This allows us to expand any sequence propagator in the 
Walsh basis. We note in passing that the Walsh basis 
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Figure 1: (color online) Walsh functions listed by Paley ordering, (a) The first 32 Walsh functions, (b) The normalized pulse 
locations, Sj, corresponding to the digital transition points of Walsh functions. Sequences WDD2»-_i are highlighted in both 
panels. 



is technically over-complete while the Rademacher func- 
tions form an orthonormal system [54] . 

We define the Walsh DD (WDD) sequences directly 
based on the Walsh functions: that is, for a given running 
time r, the sequence WDD„ is determined by the control 
propagator 



y(t)=W n (f/r), te[Q,T]. 



(9) 



Equivalently, we may specify the normalized pulse lo- 
cations for WDD„ as the switching locations of W„(a;), 
whereas the constant pieces of the Walsh functions cor- 
respond to interpulse delays. By construction, the num- 
ber of pulses used in WDD„ is given by the sequency s 
associated with W„. More explicitly, if in binary nota- 
tion n = (b m b m -i—hh and s = (flVnSm-i— Si)2, then 
9i = bi + &i+imod 2 [56], known as a Gray code. 

Both the total running time, r, and the minimum 
switching time, r m - m = t/2™, impose constraints on the 
accessible WDD sequences. First, we must have m — 
0(log 2 n), the number of digits in the binary representa- 
tion of n for a given WDD sequence. Once these param- 
eters are fixed, there is a maximum value n ss 0(r/r min ) 
for which WDD„ is viable. Reversing this argument, by 
focusing on a finite set of WDD„ and a fixed running 
time, we automatically accommodate a finite minimum 
switching time. 

Figure [ijb) depicts the pulse locations plotted as Sj for 
a normalized sequence duration r = 1 extracted from the 
W32 (t/r). Some familiar DD sequences are immediately 
identified: WDDi is the spin echo sequence and WDD3 is 
the two-pulse Carr-Purcell-Meiboom-Gill (CPMG) pulse 
sequence. We will return to these correspondences in 
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B. Error suppression properties 

A primary aim in the construction of DD sequences 
is to increase the order of error suppression, resulting in 
better cancellation of noise at sufficiently low frequen- 
cies [TO, 40, 46,(571 [58]. I n order to characterize the error 
suppression capabilities of WDD sequences, we begin by 
noting that the Walsh functions, constructed as products 
of r Rademacher functions [Eq. ^] satisfy the following 
important equality (see Appendix for an explicit proof): 
-1 

[R n (x) ■ ■ -R jr (x)}x k dx = 0, k = 0, . . . ,r - 1, (10) 

where the {jk} indices label the locations of the r non- 
zero digits in the binary representation of n. Equiva- 
lently, the monomials x k have no Walsh-Fourier compo- 
nent a n as long as n has a Hamming weigth of at least k. 
Let us focus on the Fourier transform of the propagator 
function for WDD„. Using Eqs. ^ and (|9| we have: 

v{ut)=t f W n (x)e iurx dx, 



k—r 



(iuiTxY 

kT~ 



dx. 



where the powers of x below r have been eliminated by 
using Eq. Together with Eq. the above equa- 

tion implies that 



WDD 



Xut) cx {lut) 2 ^ +1 \ 



(11) 
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Thus, WDD„ suppresses errors up to order r. Expressed 
differently, all WDD n derived from Walsh functions com- 
posed of r Rademacher functions exhibit the same order 
of error suppression. The relationship between r, the 
Hamming weight of the Walsh order n, and the order of 
error suppression is one of the main results of this work. 

The error-suppressing properties of WDD n can also be 
directly established upon obtaining a compact expression 
for the corresponding filter function (in analogy to [3D]), 
which is possible by using Eq. ^ in combination with a 
change of variable e %UT l' im i-> z. For n — (b m • • • 60)1, the 
filter function for WDD„ is given by 

rn 

fwdd„h= (l-^na + M)^ 2 " 1 ) 

3=1 

= 4 m+1 smVr min /2) 



n sin2 ( 2 



i-2, 



Jl cos 2 (2^ 2 ^r min ). (12) 



{j\bj = l} 



{j\bj=0} 



metric factors appearing in Eq. (12 1 have complemen- 



where the product chooses sin or cos factors based on 
the j-th binary digit bj of n. Interestingly, the trigono- 



tary implications for the filter function and ultimately for 
error suppression. The smallest common period among 
these terms is at WT m j n = 2tt. Each sine term is linear 
in lot at to — and each factor of sin (2 J ' — 2 &>T m i n ) sub- 
sequently contributes a factor proportional to (wt) 2 to 
the filter function. These combine to give -FwDDn (ut) oc 
(wr) 2 ( r+1 ), consistent with Eq. (111. Also, all sine terms 
have zeroes at WT m j n = kir for integer k, that correspond 
to zeroes at u> = 2 m fc7r/r. In contrast, they all carry large 
maxima occurring at uj = 0(T~ in ). The cosine terms on 
the other hand have no effect on the filter function around 
lj = 0, but drop to zero at somewhat higher frequencies 
(at which the sine terms might be significantly large). 
This suggests that they can reduce the spikes of the sine 
terms and contribute to increasing the bandwidth VL V . 

With the insights above, we find that sequences 
WDD2> _i have the lowest value of sequency (pulse num- 
ber) for a given order of error suppression (among the 
Walsh family). This is because the binary representation 
of a digital integer n = 2 r — 1 requires all 6, = 1. Figure 
[2] depicts the filter functions (in a log-log scale) associ- 
ated with WDD 2r -i for r = 1, • • • , 5 (as identified in 
Fig. [I]) , where we illustrate how the low- frequency rolloff 
is increased by increasing r. We compare the filter func- 
tions for these sequences to those of Uhrig DD (UDD), 
known to provide s-order error suppression given s pulses 

[Bang. 

In Fig. [2^b) we plot the filter functions for all WDD 
sequences with n < 32 and r = 4. We see that all have 
the same low-frequency rolloff, validating the assertion 
that the order of error suppression in a WDD sequence 
is determined by r. 

The actual coherence error, 1 — W, associated with a 
given DD sequence can be calculated by using the rele- 
vant power spectrum Sp(u)) for the noise in Eq. Bl). The 



presence of spikes in the filter function at u = 0(r~l n ) 
(not shown) implies that we can only expect error sup- 
pression as long as w c r min < 1. We can guarantee this by 
shrinking r m ; n through use of more frequent decoupling 
pulses, as far as the technological limitations on pulse 
rates permit this (see Ref. [44] for a discussion of the 
lower bounds on error suppression in DD at constrained 
minimum switching time). 

Fig. [2jc) depicts the performance of WDD 2 ^i se- 
quences for a specific illustrative case where al/w 2 noise 
power spectral density S(u>) persists up to a Gaussian 
high-frequency cutoff ui c (given in units of inverse time in- 
terval 1/t) . We have scaled the noise to have a value com- 
parable to that derived from the low-frequency behavior 
of nuclear spin diffusion in singlet-triplet qubits [SDJ . We 
observe, as expected, that not only does the calculated 
1/e coherence time increase with n, but the slope of the 
error accumulation at short times also increases with n. 
This is a manifestation of the increasing order of error 
suppression with r described above, and shows the util- 
ity of high-order WDD sequences for quantum computing 
applications where minimizing the error probability is of 
utmost importance 60 . 



C. The Walsh sequence suite 

It is clear from the previous discussion that a number 
of familiar integer-based DD sequences can be identified 
as special instances of WDD sequences. These sequences 
are described next for reference and are summarized in 
Table [T] For example, periodic DD involves repetitive ap- 
plication of uniformly spaced ir pulses lj . A Walsh func- 
tion composed of a single Rademacher function Rk(x) 
(r = 1) corresponds to a PDD with 2 r+1 — 1 pulses. 
CPMG sequences are modifications of PDD in which the 
first and last free-evolution periods are half the duration 
of the interpulse period. We refer to a CPMG sequence 
with n pulses as CPMG„ . A Walsh function of the form 
Rk(x)Rk-i(x), with k > 2, corresponds to r = 2 and 
CPMG 2fc . 

We also observe that the sequences defined by 
WDD2> _i, identified in the previous section, are in fact 
concatenated DD (CDD) sequences. The latter are 
known to allow arbitrary orders of error suppression in 
DD for arbitrary noise models by recursive embedding 
of a sequence in a "larger" one [H [ST]. As long as a 
first-order DD sequence can be implemented in a sys- 
tem, concatenating it with itself results in higher or- 
ders of cancellation, at the expenses of increased se- 
quence length. For a purely dephasing environment as 
examined here, the natural first-order DD sequence (the 
"base" for concatenation) is the well-known spin-echo se- 
quence, characterized by two equal intervals separated by 
a 7r pulse; each higher level of concatenation corresponds 
then to embedding the spin echo in a larger spin echo 
[331 136] . Using induction, we can show that each con- 
catenation level corresponds to multiplying the sequence 
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Figure 2: (color online) Error suppression properties of WDD sequences, (a) Filter functions calculated for sequences derived 
from WV-i(i/i~) and corresponding to CDD r , with fixed total sequence duration, r. For each value of r, the number of 
fundamental Rademacher functions that are needed to produce the sequence increases, and the order of error-suppression, 
demonstrated graphically by the low-frequency rolloff of the filter function, increases by 6 dB/octave. Dashed lines correspond 
to the UDD sequence providing the same order of error suppression, (b) Filter functions for all WDD sequences with n < 32 
and r — 4. UDD4 is represented by a dashed line and WDD15 by a solid line. Dashed lines correspond, left-to-right, to 
WDD23, WDD27, WDD29, WDD30. (c) Decoupling error as a function of dimensionless frequency normalized by the cutoff 
frequency, uj c in Sp(uj) — au~ 2 exp [— (u/u) c ) ]. Here, a = 5x 10 16 , and frequency ranges are set as in Ref. |59| . Sequences are 
same as in (a), and increase in n from left to right. 



n 


WDD n 


Number of Pulses 
(Sequency) 


Rolloff 
(dB/octave) 


T 


PDD 


2 r+i _ 1 


6 x (1 + 1) 




CPMG 


2 r 


6 x (2 + 1) 


r - 1 


CDD 




6 x (r + 1) 



Table I: Familiar sequences among WDD n . \x~\ denotes the 
ceiling function, that is, the smallest integer not less than x. 
The exact expression for the number of pulses in CDD was 
obtained with the aid of 1621. 



propagator by a Rademacher function. This results in a 
product of all Rademacher functions up to order r, thus 
b\ = . .. = b r = 1, which corresponds to WDD 2 r_i. In 
summary, 



CDD r o Ri(x) • • • R r (x) o WDD 2 r_i. 



(13) 



Again, the WDD sequences that correspond to CDD have 
a propagator corresponding to a product of Rademachers 
of all orders from 1 to r, giving r-order error suppression 
with the lowest value of sequency, s. 

The WDD family significantly enlarges the sampling 
set of DD sequences relative to the much more con- 
strained CDD. If the minimum switching time is con- 
strained to T min = t/2"\ then all the 2 m WDD„ se- 
quences with n = 0, • • ■ , 2 m — 1 are viable. In contrast, 
the subset of viable CDD sequences for the same fixed 



total time contains only m sequences, an exponentially 
smaller number. (However, it remains an even smaller 
subset of all possible digital DD sequences, which con- 
tains 2 2 sequences. This potential advantage will be 
discussed in Sec. fv} ) 

WDD thus forms a unifying mathematical framework 
for the generation of digital DD sequences, including 
many familiar sequences and a large variety of novel se- 
quences. Using insights derived above, we may fully char- 
acterize the structure of arbitrary Walsh functions, and 
therefore arbitrary WDD modulations. By the structure 
of the Walsh functions themselves, all WDD sequences 
can be produced recursively from free evolution by com- 
bining the following two intuitive operations: 

• Repetition, where the sequence propagator y(t) is 
repeated identically to produce a longer sequence: 



y(t) 



!/((*- t)/2), 



t < t/2, 

t/2 < t < T. 



• Concatenation, where the sequence propagator y(t) 
is repeated with the sign reversed, still yielding a longer 
sequence: 



y(t) >-> 



y(t/2), t < r/2, 

-f/((i - r)/2), r/2<t<T. 
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The actual implementation of repetition and concate- 
nation may involve inserting a tt pulse in the mid- 
dle of the formed sequence to account for the re- 
quired sign change at t = r/2. In general, WDD2„ 
is constructed by repeating WDD„, whereas WDD 2n +i 
is constructed by concatenating WDD„. For in- 
stance, WDD 30 = (WDD 15 )(WDD 15 ) [repetition] and 
WDD31 = (WDDi 5 )7r(WDDi 5 ) [concatenation], but 
note the reversal of the role of the middle pulse in 
WDD M = (WDD 7 )7r(WDD 7 ) [repetition] or WDD15 = 
(WDD 7 )(WDD 7 ) [concatenation]. 

Each concatenation increases the rolloff slope by one 
order (by contributing an additional sine factor in 
^wdd„)j whereas each repetition does not improve the 
rolloff (a cosine factor) and may instead increase the 
bandwidth of the filter, This produces a diversity 
of design features that can be used to improve coherence 
times (e.g. the l/e decay time denoted T2) in addition to 
coherence values (or, equivalcntly, error rates) by choos- 
ing the appropriate Walsh basis function. 



IV. BEYOND CLASSICAL PHASE NOISE AND 
QUANTUM MEMORY 

In this section, we consider different extensions of the 
idea of Walsh modulation to control settings more general 
than examined thus far. 



A. Quantum phase noise 

Decoherence in a quantum system is most generically 
described by the interactions with a quantum environ- 
ment. We focus first, as before, on a single qubit (see 
below for multi-qubit extensions). In the absence of con- 
trol and an internal system Hamiltonian, evolution under 
the noise Hamiltonian of Eq. ([I]) is then replaced by evo- 
lution under an open-system dephasing Hamiltonian of 
the form 



H — Hsb + H 



H; 



SB 



B 7 



where physically Hsb represents the interaction term, 
and B z and Hb are generic operators acting on the en- 
vironment, respectively. Typically, Hsb causes entan- 
glement between the system and the environment, which 
eventually results in loss of phase coherence and mixed 
qubit states. 

The WDD sequences can be applied to the above gen- 
eral quantum dephasing scenario. Under a sequence 
of 7r pulses, the evolution is identically generated by a 
piecewise-constant Hamiltonian ±cr z ® B z + Hb- Here, 
the filter function formalism is not exactly applicable in 
general. An important exception, as noted, is provided 
by the case of a bosonic bath [TJ [TO] 03] , and results accu- 
rate up to the second order in Hs b have been established 
in [63 for an arbitrary quantum dephasing environment 
under periodic DD (see also |64| for exact results on more 



general classical noise models). Nonetheless, a descrip- 
tion based on an effective Hamiltonian and the Magnus 
expansion can be used to approximate the evolution of 
the system provided that \\B Z \\ (operator norm of B z ) is 
sufficiently small. Making note of the concatenated and 
repetitive structure of WDD and following Ref. [61] , we 
can show that the norm of the effective Hamiltonian for 
the evolution under WDD„ is given by: 



I^wddJI = 0[\\B z \\r r max(\\B z \\, \\H B \\Y 



(14) 



where the strength of the bare interaction \\B Z \\ is scaled 
by a factor proportional to r r , with r denoting, as be- 
fore, the Hamming weight of n. This implies that the 
expansion of the propagator for the evolution in r starts 
with the power r r+1 . The fidelity loss can be defined 
as a distance between the ideal state (original state in 
DD) and the actual propagated state of the system at 
the end of the evolution (after tracing out the environ- 
mental degrees of freedom) |65j . In the limit of small ac- 
tions TifyvDD,,: the fidelity loss scales with HtHwdd,, || 2 
or, equivalently, the qubit fidelity loss scales at worst with 
T 2 ( r + 1 ). While this result mirrors the error cancellation 
properties of WDD in the filter function formalism [Eq. 
(Ill], we emphasize that asymptotic relationships such 



as Eq. ( 14 ) must be understood as scaling laws that may 
hide (possibly large) r-dependent prefactors. This fact 
highlights the importance of Eq. Q as an exact expres- 
sion for Gaussian phase noise for which a counterpart 
does not exist in a general quantum-dephasing setting. 



B. Generic decoherence 

As already mentioned, DD can be applied to a large 
class of open quantum systems: a control propagator 
that regularly traverses the elements of the so-called DD 
group in principle allows suppression of arbitrary non- 
Markovian decoherence, including multi-qubit noise in 
many-qubit systems |2J. When the noise operators are 
decomposed into algebraically independent components, 
one may reconstruct the generic decoupling procedure 
through concatenation as well [61]. While the group- 
theoretic DD design is applicable only to first-order error 
suppression, concatenation allows us to combine DD se- 
quences for different noise axes to produce a high order 
and generic decoupling procedure. This idea is the basis 
of a variety of recent DD schemes, such as quadratic DD 
(QDD) [55] and its multi-qubit variants [5T1 167) . 

The WDD sequences can be similarly concatenated 
along different axes to allow suppression of general noise. 
For example, consider a generalized classical noise Hamil- 
tonian acting on a qubit: 



H„ 



/3 x (t)a x + P y {t)o- y + z (t)a z , 



where f3i(t) are stochastic processes. Applying a WDD 
sequence with o~ x pulses effectively removes the f3 y (t)o~ y + 
fiz(t)o~z noise component while it leaves P x {t)o~ x intact. 
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Embedding this sequence itself in a WDD sequence with 
a y (or a z ) pulses removes the remaining f5 x {t)cr x compo- 
nent as well. We can generalize this idea to define generic 
Walsh dynamical DD (GWDD) for a qubit. The control 
propagator U c (t) for the sequence GWDD ra is given by 



U c (t) 



- n Mt)+n/2 [y(t)+l]/2 



where 



x(t) = R h (t)R h (t) 
y(t) = R ja (t)R i4 (t) 



' I R j2r-l(*)l 

RjvW, 



and, as in the single- axis case, the {j^} indices label the 
locations of non-zero digits in the binary representation 
of n. In practice, the final sequence corresponds to ap- 
plying o~ x or a y pulses at rates given by the the corre- 
sponding Rademacher functions, taking into account the 
algebraic simplifications such as a x o~ x = I or a x o~ y = a z 
and ignoring all the resulting ± signs. We note that for 
n = 2 2r — 1, GWDD„ reproduces the generic CDD se- 
quence of level r described in Ref. [5] , while repetitions of 
GWDD,, include truncated periodic CDD protocols (so- 
called PCDD) such as investigated in [BE] 69 ■ Again the 
Walsh functions form a unifying mathematical basis for 
sequence construction, reproducing familiar decoupling 
protocols. 

For a multi-qubit system where qubits interact with 
the environment linearly ^ [T3] , GWDD is also applica- 
ble, except that each unitary pulse operation has to be 
replaced with a collective version that affects all qubits 
simultaneously and equally. We remark in passing that 
the idea of Rademacher products (concatenation in par- 
ticular) can in principle be extended to generic finite- 
dimensional control systems provided that the role of a x 
and <j y is replaced with the generators of the correspond- 
ing DD group [70] , 



C. Non-identity operations 



where Eq is the error per gate (or error action) asso- 
ciated with Q and the goal is to minimize Eq for any 
desired Q. The basic intuition is to use the separation 
of the ideal gate action and the error to mix and match 
error parts so that altogether the errors cancel out in a 
perturbative manner. 

DCGs have algebraic connections to DD sequences but 
they remove the need for instantaneous ideal pulses and 
are used to implement non-identity unitary actions on 
the system. The general theory for constructing DCGs 
to higher orders for arbitrary systems appears in |14) . 
but for our purpose we focus on an example that applies 
to a recent experiment on correcting errors due to laser 
frequency jitter in a multi-qubit entangling gate mediated 
by laser light [50] . 

Consider first the spin-echo sequence XfXf, where X 
denotes an ideal a x gate and / a free evolution interval. 
The latter can be interpreted as a primitive implementa- 
tion of the identity gate /. Thus, we may write the spin 
echo as 



/(i) = xi^xi^K 



(15) 



where J*- - 1 refers to a zeroth-order approximation for the 
identity action through free evolution, and refers in- 
stead to an improved approximation of the identity ac- 
tion with a higher order of error suppression. By recur- 
sively applying Eq. (151, or by repeating an improved 



identity gate, we obtain the WDD sequences correspond- 
ing to progressively better and longer approximations of 
the identity gate. Consider next a specialized scenario in 
which the X gates are still ideal but, instead of free evo- 
lution, we apply a target gate Q with an associated error 
Eq. We also assume that the intended gate action Q 
and the X gates commute. We begin with the sequence 
XQXQ whose actual propagator is given by 



XQexp(— iEQ)XQexp(— iE Q ) — 
Q 2 X exp(-iE Q )X exp(-iE Q ), 



(16) 



Our focus so far has been on error suppression while 
preserving arbitrary quantum states, hence effectively 
implementing the identity operator with higher fidelity 
than free evolution for a given duration. The periodic 
properties of the filter functions associated with WDD 
sequences also allow us to achieve suppression of fre- 
quency noise while effecting non-trivial (non-identity) 
quantum gates, close in spirit to dynamically corrected 
gates (DCGs) [TMl]- 

The starting point in DCG constructions is the sepa- 
ration of the action of a gate on a noisy system into ideal 
and error parts, both represented as unitary operators. 
Concretely, let Q and Uq denote the ideal unitary gate 
that is to be implemented and the actual unitary prop- 
agator corresponding to the evolution during the control 
that aims to implement Q, respectively. We can write 

U Q = Qexp(-iE Q ), 



where X exp(— iEo)X exp(— iEo) can be interpreted as 
a spin-echo sequence for free evolution with an effective 
Hamiltonian Eq. Thus, if Eq consists of terms that an- 
ticommute with X (i.e. XEqX = —Eq), we expect that 
the sequence XQXQ suppresses error in a manner sim- 
ilar to XIXI, except that the resulting action will now 
be the non-trivial gate Q 2 . Furthermore, we may con- 
catenate or repeat this sequence to obtain higher-order 
suppression of errors at the expense of a longer gate se- 
quence. 

The model which we just described for a simple high- 
order DCG applies to the experimental setup in Ref. |50) . 
in which a spin-motional entangling gate is applied to 
trapped ion qubits. The fundamental concept of this 
gate is that by state-selectively exciting the harmonic- 
oscillator motional modes of ions in a shared trapping 
potential, it is possible to entangle the internal spins of 
the qubits. The actual unitary propagator associated 
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with a single gate is given by 



U Q (t) 



Q. 



(17) 



where Q is a geometric phase gate, described above, that 
entangles the qubits. This gate relies on disentangling 
the spin and motion at the end of the gate by detuning 
the driving force from a motional resonance by 5 and 
setting the gate time t = j2ir/5, where j is a positive 
integer. In this case the drive and motion desynchronize 
at t, and one precisely implements Q. Residual spin- 
motional entanglement corresponds to an error and we 
thus define the error per gate as 

—iEq = Sn [a{t)a) — a*(t)a\. 

The coefficient a(t) is the time dependent displace- 
ment due to the laser detuning 5, that is, a(t) — 
(fi/2) J dsexp(-iSs). Such error may be due, in prac- 
tice, to the fact that 6 typically carries an additive fre- 
quency error A for which we have 



a{t) 



/ risexp[— i(5 + A)s]. 
Jo 



In this case we find that the harmonic oscillator does not 
produce a "closed loop" in phase space, thus yielding an 
error due to residual spin- motion entanglement. 

Our goal is to achieve an improved approximation of 



the entangling gate in Eq. (17) by cancellation of a(t) to 
a high order at the end of the evolution. Besides varying 
the detuning <5, the evolution of a(t) can also be con- 
trolled by flipping the phase of the laser-mediated optical 
dipole force used to drive the ion motion (a nearly instan- 
taneous action), which effectively corresponds to switch- 
ing the sign of the interaction. Consider the evolution 
of the system punctuated by such phase flips occuring at 
the times p — with to — and i n +i = r to de- 

note the total gate/evolution duration. The expression 
for a(r) is then given by 



O 

3=0 



and we note that 

rtj+i 

dsexpf— i(S + A)s] = 



ds exp[— i(S + A)i 



-i(S+A)t s _ e -i(S+A)tj+i 

S + A 



In reality, the mismatch A may follow a probability dis- 
tribution P(A) and we can write the expected value of 
|q(t)| 2 as 



HrW 



4 



where we have used the expression for the filter func- 
tion in Eqs. |2]) and This expression resembles the 
equation for the decoherence error associated with a DD 
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Figure 3: (color online) Noise suppression at fixed frequency. 

(a) Filter function about normalized frequency 8 on a semilog 
scale for various WDD sequences, resulting in a notch where 
noise is suppressed. The notch bandwidth increases with se- 
quency and the order of error suppression about S increases 
with r. Inset: Same filter functions plotted on a linear scale. 

(b) Filter function normalized by cj 2 for WDD15 and WDD31. 
The solid thick line shows WDD 15 at 8/2% = 2 5 , where the 
horizontal axis represents detuning from 8. Here the order of 
error suppression as a function of detuning has increased by 
one order and is comparable to that for WDD31 near zero 
frequency. See text for details. 



sequence applied at the same times, except that the ar- 
gument of the filter function in the integrand is shifted 
by a value S, and instead of the power spectrum Sp(uj) 
the error probability density P(A) is used. To cancel the 
effects of A we require the filter function to have a zero 
of high multiplicity at u> = 5. 

Such a zero may be realized in the filter function of the 
WDD sequence; from Sec. |III B| we recall that the fil- 
ter function F(ujt) has a natural period (translational 
invariance, see Fig. [3]) at T m j n /27r, corresponding to 
r/(2 m+1 7r). This produces the requisite zeroes in the 
filter function at fixed values of 5, a feature intrinsic to 
the formulation of WDD. 

In order to gain a quantitative understanding, let us 
calculate the Fourier transform of the sequence propaga- 
tor, 



jr(wr) = / dt\N n (t/r)e^ +A ^ T . (19) 
Jo 

We extract the filter function's dependence on A near 8 
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by use of an equality that is similar to Eq. ( 10 ) 



dx Rfc (x) e 
{k} 



;2 r 



i=0 



(20) 



where the set {fc} has r elements in it, k > 1 and 
max(fc) < r. A sketch of the proof of Eq. (20) appears 



in the Appendix. If the frequency 5/2ir — 2 k Jr, then Eq. 
( 19 1 can be written as 



y{uJT) = 



d£ Y[ R (k, t') e l 

{k} 



OO 

E 

rn— r+1 



(zA'f') T 
m! 



(21) 

where we have defined t' = t/r and A' = At. This 
expression indicates that y(u>r) scales like A r+1 in the 
vicinity of S. Thus, the slope of the rolloff as a function 
of detuning is set by r, giving increasing robustness to 
fluctuations in detuning from S. For fixed r, the band- 
width of the "notch" increases with sequency, as does the 
equivalent il p . These properties are expressed in Fig [3] 

While the filter function is translationally symmetric, 
we see that the form of filter performance around S is 
not the same as it is around uj = 0, due to the factor of 
w~ 2 appearing in Eq. [2] An expansion as a function of 
A therefore yields improved performance relative to the 
zero-frequency rolloff when accounting for this factor. 

It thus follows that for the specific case treated here 
by flipping the laser phase at times corresponding to the 
WDD„, the error in the detuning S can be suppressed up 
to order r+1, where as before r is the Hamming weight of 
n. This general approach was studied experimentally in 
Ref. [50]. Interestingly, this sequence can be interpreted 
as a high-order DCG where the primitive gates Q are 
defined as T m ; n -long periods of evolution at detuning d = 
27r/r m i n and the X gates are implemented by phase flips. 
We can interpret all the resulting DCGs generated by 
WDD sequences as repetitions and concatenations of the 
basic sequence XQXQ, resulting in a corrected gate Q 2 
which is the desired entangling operation but has an error 
that scales with A 2 ( r+1 l 



V. BENEFITS OF WDD OVER OTHER 
OPTIMIZED APPROACHES 

Interest in the DD community has recently focused on 
optimized pulse sequences producing high-order suppres- 
sion of noise either through analytical approaches (e.g. 
UDD [TP]), numerical optimization (e.g. Locally Opti- 
mized ODD (LODD) PO], Optimized noise Filtration DD 
(OFDD) @g, Bandwidth Adapted DD (BADD) 03), or 
combinations of optimization and concatenation strate- 
gies (e.g. concatenated UDD (CUDD) [71], QDD [BBJ, 
Nested UDD (NUDD) [51 ). These sequences bring many 
benefits in terms of resource-efficient DD, as they gener- 
ally optimize error-suppression against pulse number. 

Studies have shown, however, that the extraordinary 
benefits provided by optimized sequences are largely sup- 



pressed in the presence of realistic constraints such as 
imperfect control pulses [HI [T2J, [73] , digital clocking [16] , 
and timing limitations [43] . Moreover, noise suppression 
benefits have been shown to be minimal for Sp(uj) dom- 
inated by low-frequency noise and exhibiting slowly de- 
caying high-frequency tails [40] [42j [74] . 

The WDD sequences possess benefits over existing ap- 
proaches along two primary metrics: 

• Efficient hardware sequencing; 

• Restricted search space for sequencing. 

We will separately explore each of these benefits next. 



A. Efficient hardware sequencing 

Current experiments in DD employ a user- 
programmed microprocessor and a complex hardware 
chain (e.g., a signal generator controlled by a pro- 
grammable logic device responsible for pulse timing and 
under PC control). This is appropriate for demonstra- 
tion experiments, but fails to provide a scalable solution 
due to both sequencing challenges and the difficulty of 
input-output (I/O) in complex or quantum systems. We 
are therefore interested in finding solutions permitting 
all sequence generation to be performed at the local 
level. 

Once we accept this consideration, our metrics of ef- 
ficiency change relative to the majority of published lit- 
erature. When considering optimized DD, sequencing is 
complex and will likely require either a local micropro- 
cessor to decode instructions and apply a DD sequence, 
or multiple high-bandwidth communication pathways to 
external controllers. Both situations pose challenges in 
terms of significant local power dissipation in control 
hardware and energy inflows associated with I/O path- 
ways. The energy expended in the number of pulses ap- 
plied may contribute only a small amount of the total lo- 
cal power dissipation given these considerations. Given a 
presumed need for local DD sequence generation we thus 
arrive at digital sequencing complexity as the relevant 
metric for efficiency. 

WDD meets this challenge, its primary benefit being 
that the Walsh functions are easily produced using sim- 
ple digital circuits 48] . We identify Rademacher func- 
tion generators (square-wave generators) as basic hard- 
ware resources for the physical-layer implementation of 
dynamical error suppression, realizing the control prop- 
agator in real-time. Rademacher functions may be gen- 
erated in hardware with relative case from a distributed 
clock signal, and may be added/multiplied together via 
hardware logic to generate Walsh toggling frames (for in- 
stance Harmuth's array generator |48j ) . The number of 
Rademacher functions required to achieve a given error 
suppression may therefore be deemed a relevant quanti- 
tative means of establishing the sequencing complexity. 

As we have shown, a WDD sequence derived from 
r Rademacher functions cancels the first r — 1 orders 
of dephasing noise, and the sequences derived from 
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\A/2>-_i(x) = CDD r do so for the smallest value of 
sequency (for fixed r). WDD therefore provides the 
highest-possible order of error suppression with respect 
to the number of Rademacher functions required to im- 
plement a given DD protocol, providing efficiency in the 
sequencing complexity. For the same sequencing com- 
plexity r, using sequences with different n permits mod- 
ification of rip, giving flexible control over noise filtering 
capabilities. By contrast, other optimized sequences dis- 
cussed above require substantially greater sequencing re- 
sources, as the Walsh-transformation of a sequence such 
as UDD indicates the need for a large r in order to gen- 
erate the requisite {dj}. 

Furthermore, the structure of WDD and Walsh func- 
tion generation is compatible with a nested or concate- 
nated structure of decoupling about multiple axes, which 
enables the generation of GWDD sequences for generic 
decoherence. In the special case of a large separation 
of timescales for different decoherence processes, GWDD 
also allows for efficient bit-stacking of Walsh functions de- 
rived from similar hardware, but executed with different 
time-bases. Finally, we note that certain sequences con- 
structed by recursively repeating the same base sequence 
many times may find value in circumstances where long- 
time storage and low-latency memory interrupts are de- 
sired in addition to improved low-frequency error sup- 
pression. This topic will be addressed in a separate 
manuscript. 

Once a given Walsh function (or set of Walsh 
functions) is generated, conversion to WDD in real 
time may be achieved via hardware differentiation 
or edge-triggering of separate circuitry producing pre- 
programmed control pulses. Overall complexity is re- 
duced by separating control sequencing from pulse gen- 
eration. 

Using Walsh functions for generation of the control 
propagator and hardware techniques for the triggering of 
applied pulses therefore provides a means to efficiently 
realize dynamical error suppression at the local level. 
Through this approach the need for access to a user- 
controlled microprocessor in order to send pulse-sequence 
commands to hardware, or for a local processor to inter- 
pret externally generated commands may be obviated: 
all control and timing may be performed using relatively 
simple digital logic realized in an Application Specific In- 
tegrated Circuit. A system protected by WDD sequences 
may run semi-autonomously, implementing pre-selected 
WDD sequences, or it may be programmed externally, 
where the only communicated information required could 
be the value of n in binary (or as Gray code) , a repetition 
number, and a trigger signal starting the sequence. 

A significant challenge in such schemes is suppression 
of spurious rising/falling edge signals arising from, say, 
propagation delays in digital circuitry. However, latching 
WDD sequences to the clock signal should aid suppres- 
sion of these signals. Accounting for nonzero-duration 
control pulses with r w an integer multiple of the clock pe- 
riod, may also be achieved through use of latching logic 



to introduce hardware delays while the control operations 
are applied. However, the effect of such delays on WDD 
performance would need to be characterized in detail. 
The creation of an optimized hardware-based generator 
of WDD sequences remains a problem for future study. 



B. Restricted search space 

While there already exists a plethora of DD procedures 
within the bang-bang limit, in practice not all these pro- 
cedures will be suitable to the operating range of physical 
systems or the demands of quantum information process- 
ing protocols. This is true despite the fact that the ob- 
jective in all such procedures is identical: maximize the 
fidelity of preserving an arbitrary quantum state for a de- 
sired time. For sufficiently simple experimental systems, 
an empirical optimization procedure (such as LODD) can 
sample over the space of all permissible DD pulse se- 
quences and search for an optimal one. This idea has 
also been explored numerically in the above-mentioned 
OFDD and BADD protocols, as well in optimized DD 
for power-law spectra [73]. These approaches grow sig- 
nificantly in complexity as the pulse number increases, 
for instance in a large scale system requiring long-term 
storage. The central idea in WDD - that sequences us- 
ing digital timing are made by attaching or repeating 
smaller sequences recursively - can be instrumental in 
significantly reducing the search space for finding opti- 
mal DD sequences empirically or numerically. We briefly 
outline potential implications of this idea here. 

The canonical model of single-qubit dephasing with 
a given noise power spectral density used through most 
of this paper retains considerable structure in choosing 
when to apply the X pulses. This persists even when en- 
forcing digital timing conditions as below. Given TV = 2 m 
time-bins, each of duration r min , one may elect to ei- 
ther apply or not apply a pulse, thus generating a space 
of 2^ = 2 2 ™ = 2 2T/T " 11 " 1 possible sequences. Perform- 
ing a complete search over this space becomes extremely 
challenging as the total duration r of the DD procedure 
grows, as observed in numerical approaches to generat- 
ing randomized DD protocols [3H1 US] and to optimized 
sequences [2"U1H2"] . 

WDD provides a natural means to reduce the search 
space for the generation of dynamical error suppression 
sequences as there are only N = 2 m = 2 T ' Tmta WDD 
sequences within the operational constraints of the prob- 
lem. It also provides an intuitive analytical framework 
allowing pre-selection of certain sequences within WDD, 
further reducing the search space. For instance, a search 
might exclude all WDD sequences with r < r m ; n , as 
they will be known to provide insufficient low-frequency 
noise suppression. With such constraints, and the rela- 
tive simplicity of implementing WDD in hardware, these 
sequences become especially attractive for an empiri- 
cal search in which the performance of each sequence is 
tested in an actual experimental setting. 
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Perhaps even more interesting are the options offered 
when Walsh DD is extended to generic decoherence of a 
larger set of qubits in a quantum memory. Recently, an 
efficient perturbative DD procedure for this task has been 
explored (NUDD) that utilizes {D 2 s ) r pulses for rth order 
decoupling of a generic D$ — 2™« -dimensional system of 
n q qubits [51 j - This procedure assumes the most general 
form of errors, including many-body errors. However, 
in reality error models are generally sparse. A funda- 
mental problem of DD of many qubit systems (closely 
related to quantum simulation algorithms [701 EH 177] ) 
is to optimize the decoupling when the error model for 
the interaction of the qubits is locally sparse. Using the 
procedure described in Sec. |IV B[ we can progressively 
produce a search space for such an optimization proce- 
dure starting from elements of a basic DD sequence and 
recursively building longer sequences by the WDD con- 
struction procedure (repetition/concatenation). 



VI. CONCLUSION 

In this manuscript we have studied the problem of 
reducing sequencing complexity in a dynamical error 
suppression framework, through consideration of digital 
modulation schemes. Towards this end we have intro- 
duced the Walsh functions as a mathematical basis for 
the generation of dynamical error suppression sequences. 
We have revealed how Walsh dynamical decoupling natu- 
rally incorporates familiar sequences (e.g. PDD, CPMG, 
CDD), and examined the properties of all the possible 
recursively structured sequences corresponding to Walsh 
functions. 

Our analysis has demonstrated that the order of er- 
ror suppression achieved by a given WDD sequence is set 
by the number of elementary Rademacher functions re- 
quired to generate that sequence. This is manifested as 
a scaling of the low-frequency rolloff in the filter function 
as 6(r + 1) dB/octave, with r the number of Rademacher 
functions appearing in the sequence. Meanwhile, the 
high-frequency performance of a WDD sequence, and 
hence its noise-suppression bandwidth, is tunable via se- 
lection of n for a given r, providing significant flexibility 
in sequence construction. Further we have introduced a 
simple means to construct WDD sequences using con- 
catenation and repetition, and a technique to suppress 
general errors via GWDD. 

We have shown that the Walsh functions provide effi- 
cient performance against control complexity, quantified 
by the value of r, and the supporting sequencing hard- 
ware. We believe that sequencing complexity will serve 
as a useful new metric with significant weight in system- 
level analyses, going far beyond standard optimization 
over pulse number in a DD sequence. 

These considerations are likely to make the Walsh 
functions an attractive framework for the development 
of a quantum memory incorporating hardware-efficient 
physical-layer error-suppression strategies. Interestingly, 



the problem of searching for (digital) DD sequences with 
high order error suppression properties is also related to 
finding Littewood complex polynomials with high order 
zeroes |78l 179) , which may point to further connections 
with both signal processing theory and polynomial anal- 
ysis and approximation. From a practical standpoint, 
the variety of WDD sequences realizable through sim- 
ple hardware sequencing provides a flexible solution for 
many future experimental systems. 



Appendix A: Order of error suppression 

In this section we prove that for a given Paley ordering 
r, as long as i < r: 



Ri(aO' 



• RjV-t-i {x)x l dx 



0. 



(Al) 



where j\ are non-zero positive integers in increasing or- 
der. This identity can be used to show the error suppres- 
sion of the WDD sequences directly in the time domain 
and can be mathematically interpreted as the vanishing 
of the moments of the Walsh functions on the unit inter- 
val. 

We proceed by induction on r starting with the base 
case, r = 0: 



f 
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Rj 1 {x)dx = 0. 



(A2) 



Since j\ > 0, the corresponding Rademacher function Rj t 
will be periodic and balanced on [0,1], which validates 
the base case in Eq. ( A2 1. 



For the inductive step, let us assume that 



R 3l (x). 



■Rj r+1 {x)x l dx 



0. 



where i < r. We need to prove that 



Rj 1 (x) ...Rj r+1 (x)Rj r+2 (x)x' l dx = 0, 



(A3) 



(A4) 



where i < r + I. 

The sign of Rademacher function Rj(x) is determined 
by the j-th binary digit of x which we refer to as bj{x). 
When i = 0, Eq. (A4) reduces to the average (zeroth 
moment) of the product of Rademacher functions over 
the interval [0,1]. We give a probabilistic argument 
for the cancellation of this average which can also be 
proven using induction. The average of the product of 
the Rademacher functions can be written as 



(-lf ) lW...(_l)^ + 2W ( ( x . 



(A5) 



We can interpret the above integral as an expectation 
value (denoted below by E) of a function of a real random 
variable x distributed uniformly over the interval [0,1]. 
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The corresponding (Lebesgue) probability measure dx is 
equivalent to products of independent and uniform (dis- 
crete) measures f or ea ch binary digit of x [SO]. We can 
thus rewrite Eq. ( A5 1 in the form 



E[(-l) b n ■ ■ ■ - E[(-l) b ^} ■ ■ ■ E[(-lpr+2 



0. 



We can then use the i — case as the base for another 
induction proof over i, where we assume that Eq. (A4) 
holds for all i' < r. We refer to this as the "inner in- 
duction" assumption and proceed to prove Eq. (A4) for 
i = r + 1, which we rewrite as 



I = 



\N n (x)R h Or) x r+L dx. 



(A6) 



Note that we have singled out the lowest order 
Rademacher function in the product and written the 
product of the rest as a Walsh function with Paley or- 
dering n. We can rewrite the integral as 



2 -3l 



2x2"" 



2x2-31 



■ }\N n {x)x r+1 dx, 



where the integrals match the positive and negative val- 
ues of the Rademacher function Rj 1 . 



Making the substitution x' — x — 2 31 for the negative-signed terms, we can write 



/ = 





/•3x2~ J 'i 




/ + '") 

^2x2-31 ' 



(A7) 



The Walsh function W„ is constructed entirely of Rademacher functions with indices larger than j\ having periods 
that commensurate with 2 _J1 , which results in \N n (x + 2 - -? 1 ) = \N n {x). This allows us to write 





/•3x2- J i 




/ + '") 

^2x2-31 / 



\N n {x)[x r+1 
I 



(x + 2- ]1 ) r+1 ]dx. 



(A8) 



We can rewrite the above as an integral over [0, 1] using 
the kernel (R h (x) + l)/2: 

/ = f W»(x) R3l(x) + V +1 " (x + 2-^ +l ) r+l ]dx 1 
Jo 2 

where the leading x r+1 powers cancel when we expand 
(x + 2~ J1+1 ) r+1 , leaving us with an integrand in which 
every power of x appears with an exponent i' less than 
or equal to r. This matches the assumptions of our inner 
induction, leading to the cancellation of the integral for 
i = r + 1 and thus establishing the main result in Eq. 

We also note the following result, related to the WDD 
filter function at higher frequencies and used in Sec. |IV C| 
of the main text: 

/ Ri(a;)---R jV (^)e i2 "^ = 0, (A9) 
Jo 

where i < r and we have used the highest Rademacher 
function in the product with a trigonometric function 
of the same period. This result can also be proven by 
induction in a similar manner. 



It is interesting to note that the only properties of the 
Rademacher functions that are used in our proof are their 
"frequencies" and symmetry. Thus, any family of func- 
tions that mimics the periods and the sign changes of the 
Rademacher function family could in principle be used in 
our proof. 
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